• Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos 

      Li, Haoyuan; Dong, Haoye; Jia, Hanchao; Huang, Dong; Kampffmeyer, Michael Christian; Lin, Liang; Liang, Xiaodan (Journal article; Tidsskriftartikkel; Peer reviewed, 2024-01-15)
      Multi-person 3D mesh recovery from videos is a critical first step towards automatic perception of group behavior in virtual reality, physical therapy and beyond. However, existing approaches rely on multi-stage paradigms, where the person detection and tracking stages are performed in a multi-person setting, while temporal dynamics are only modeled for one person at a time. Consequently, their ...
    • DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment 

      Zhang, Xujie; Yang, Binbin; Kampffmeyer, Michael Christian; Zhang, Wenqing; Zhang, Shiyue; Lu, Guansong; Lin, Liang; Xu, Hang; Liang, Xiaodan (Journal article; Tidsskriftartikkel; Peer reviewed, 2024-01-15)
      Cross-modal garment synthesis and manipulation will significantly benefit the way fashion designers generate garments and modify their designs via flexible linguistic interfaces. However, despite the significant progress that has been made in generic image synthesis using diffusion models, producing garment images with garment part level semantics that are well aligned with input text prompts and ...